Chinese Hedge Scope Detection Based on Structure and Semantic Information

نویسندگان

  • Huiwei Zhou
  • Junli Xu
  • Yunlong Yang
  • Huijie Deng
  • Long Chen
  • Degen Huang
چکیده

Hedge detection aims to distinguish factual and uncertain information, which is important in information extraction. The task of hedge detection contains two subtasks: identifying hedge cues and detecting their linguistic scopes. Hedge scope detection is dependent on syntactic and semantic information. Previous researches usually use lexical and syntactic information and ignore deep semantic information. This paper proposes a novel syntactic and semantic information exploitation method for scope detection. Composite kernel model is employed to capture lexical and syntactic information. Long shortterm memory (LSTM) model is adopted to explore semantic information. Furthermore, we exploit a hybrid system to integrate composite kernel and LSTM model into a unified framework. Experiments on the Chinese Biomedical Hedge Information (CBHI) corpus show that composite kernel model could effectively capture lexical and syntactic information, LSTM model could capture deep semantic information and their combination could further improve the performance of hedge scope detection.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Hedge Scope Detection in Biomedical Texts: An Effective Dependency-Based Method

Hedge detection is used to distinguish uncertain information from facts, which is of essential importance in biomedical information extraction. The task of hedge detection is often divided into two subtasks: detecting uncertain cues and their linguistic scope. Hedge scope is a sequence of tokens including the hedge cue in a sentence. Previous hedge scope detection methods usually take all token...

متن کامل

Exploiting Multi-Features to Detect Hedges and their Scope in Biomedical Texts

In this paper, we present a machine learning approach that detects hedge cues and their scope in biomedical texts. Identifying hedged information in texts is a kind of semantic filtering of texts and it is important since it could extract speculative information from factual information. In order to deal with the semantic analysis problem, various evidential features are proposed and integrated...

متن کامل

A Cascade Method for Detecting Hedges and their Scope in Natural Language Text

Detecting hedges and their scope in natural language text is very important for information inference. In this paper, we present a system based on a cascade method for the CoNLL-2010 shared task. The system composes of two components: one for detecting hedges and another one for detecting their scope. For detecting hedges, we build a cascade subsystem. Firstly, a conditional random field (CRF) ...

متن کامل

Exploiting Rich Syntactic Features for Hedge Detection and Scope Finding∗

Hedge detection and scope finding are increasingly important tasks in information extraction, especially in the biomedical natural language processing community. In this paper, a novel approach detecting hedge cues and their scopes by sequence labeling is explored. It should be emphasized that syntactic dependencies are systematically exploited and effectively integrated by a large-scale featur...

متن کامل

Hedge Detection Using the RelHunter Approach

RelHunter is a Machine Learning based method for the extraction of structured information from text. Here, we apply RelHunter to the Hedge Detection task, proposed as the CoNLL-2010 Shared Task1. RelHunter’s key design idea is to model the target structures as a relation over entities. The method decomposes the original task into three subtasks: (i) Entity Identification; (ii) Candidate Relatio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016